We propose a system able to synthesize automatically a classification model and a set of interpretable decision rules defined over a set of symbols, corresponding to frequent substructures of the input dataset. Given a preprocessing procedure which maps every input element into a fully labeled graph, the system solves the classification problem in the graph domain. The extracted rules are then able to characterize semantically the classes of the problem at hand. The structured data that we consider in this paper are images coming from classification datasets: they represent an effective proving ground for studying the ability of the system to extract interpretable classification rules. For this particular input domain, the preprocessing procedure is based on a flexible segmentation algorithm whose behavior is defined by a set of parameters. The core inference engine uses a parametric graph edit dissimilarity measure. A genetic algorithm is in charge of selecting suitable values for the parameters, in order to synthesize a classification model based on interpretable rules which maximize the generalization capability of the model. Decision rules are defined over a set of information granules in the graph domain, identified by a frequent substructures miner. We compare the system with two other state-of-the-art graph classifiers, evidencing both its main strengths and limits.

Granular computing techniques for classification and semantic characterization of structured data / Bianchi, FILIPPO MARIA; Scardapane, Simone; Rizzi, Antonello; Uncini, Aurelio; Sadeghian, Alireza. - In: COGNITIVE COMPUTATION. - ISSN 1866-9956. - STAMPA. - 8:3(2016), pp. 442-461. [10.1007/s12559-015-9369-1]

Granular computing techniques for classification and semantic characterization of structured data

BIANCHI, FILIPPO MARIA;SCARDAPANE, SIMONE;RIZZI, Antonello;UNCINI, Aurelio;
2016

Abstract

We propose a system able to synthesize automatically a classification model and a set of interpretable decision rules defined over a set of symbols, corresponding to frequent substructures of the input dataset. Given a preprocessing procedure which maps every input element into a fully labeled graph, the system solves the classification problem in the graph domain. The extracted rules are then able to characterize semantically the classes of the problem at hand. The structured data that we consider in this paper are images coming from classification datasets: they represent an effective proving ground for studying the ability of the system to extract interpretable classification rules. For this particular input domain, the preprocessing procedure is based on a flexible segmentation algorithm whose behavior is defined by a set of parameters. The core inference engine uses a parametric graph edit dissimilarity measure. A genetic algorithm is in charge of selecting suitable values for the parameters, in order to synthesize a classification model based on interpretable rules which maximize the generalization capability of the model. Decision rules are defined over a set of information granules in the graph domain, identified by a frequent substructures miner. We compare the system with two other state-of-the-art graph classifiers, evidencing both its main strengths and limits.
2016
granular computing; automatic semantic interpretation; frequent substructures miner; graph matching; graph classification; evolutionary optimization; watershed segmentation
01 Pubblicazione su rivista::01a Articolo in rivista
Granular computing techniques for classification and semantic characterization of structured data / Bianchi, FILIPPO MARIA; Scardapane, Simone; Rizzi, Antonello; Uncini, Aurelio; Sadeghian, Alireza. - In: COGNITIVE COMPUTATION. - ISSN 1866-9956. - STAMPA. - 8:3(2016), pp. 442-461. [10.1007/s12559-015-9369-1]
File allegati a questo prodotto
File Dimensione Formato  
Bianchi_Granular_2016.pdf

solo utenti autorizzati

Note: Granular Computing Techniques for Classification and Semantic Characterization of Structured Data
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.14 MB
Formato Adobe PDF
2.14 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/843415
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 18
social impact